PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Gh_D04G1941
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family Trihelix
Protein Properties Length: 349aa    MW: 38755.1 Da    PI: 10.1836
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Gh_D04G1941genomeNAU-NBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix51.52.6e-1647130186
     trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkm....rergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqle 86 
                  +W++  v  L+ea+++++   +r+klk+++We+v++++      ++  ++++qCk+k+e+++kry+ + +++ +      s++p++ +l+
  Gh_D04G1941  47 EWSEGAVSSLLEAYENKWVLRNRAKLKGHDWEDVARYVsaraNCTKSPKTQTQCKNKIESMKKRYRSESATADG------SSWPLYPRLD 130
                  5*************************************844444455556679****************99997......4699999986 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PfamPF138376.8E-2045130No hitNo description
Sequence ? help Back to Top
Protein Sequence    Length: 349 aa     Download sequence    Send to blast
MEKETNQENP SLLSNNNISI TKEDSSPKKH PGNTAAAGGG DRLKRDEWSE GAVSSLLEAY  60
ENKWVLRNRA KLKGHDWEDV ARYVSARANC TKSPKTQTQC KNKIESMKKR YRSESATADG  120
SSWPLYPRLD LLLRGSTAPP PPPLLPPQLQ PSAVPQAATP ISTNPPFMTL PEPSMMVVLQ  180
QQHPPPPPPH LAPQLPGTTQ NSHDSNGIDR IPKEDGAGTK SSGRLSDKIA METDSSTPAL  240
YSDRERPRSK KAKMKIETMA TMMKKKKRRK EECEIGGSIQ WLAQVVLKSE QARMETMKEI  300
EKMRVEAEAK RGEMDLKRTE IIANTQLEIA RLFAGSNKGV DSSLRIGRN
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1263268KKKKRR
2263269KKKKRRK
3264268KKKRR
4264269KKKRRK
5265269KKRRK
Expression -- UniGene ? help Back to Top
UniGene ID E-value Expressed in
Ghi.24910.0boll| ovule| root| stem
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankHQ5274982e-60HQ527498.1 Gossypium herbaceum clone NBRI_C_EYT27PB01AP1QB simple sequence repeat marker, mRNA sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_012459717.10.0PREDICTED: trihelix transcription factor ASIL2
TrEMBLA0A0D2VUE20.0A0A0D2VUE2_G
STRINGPOPTR_0001s06580.11e-140(Populus trichocarpa)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM98592533
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G54390.11e-102sequence-specific DNA binding transcription factors